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Comparison of Direct and Projective Methods of Personality 
Assessment Under Different Conditions of Motivation’ 


ANTHONY DAvIbs 


Brown University 


AND HENRY PILDNER, JR. 


FCTIVE personality assessment is 
the vital link between personality 
theory and human behavior. Theory 
without empirical validation is of little 
value to the experimentalist or the clin- 
ician, unless it leads to research and ap- 
plication. With the present state of 
knowledge in the area of personality as- 
sessment, psychologists are faced with a 
host of crucial, yet far from completely 
understood, problems. Handling the 
seemingly infinite variabilities of the in- 
dividual human in an integrated, orderly 
fashion has proved immensely difficult, 
and it will require much further research 
before the mysteries of accurate psycho- 
diagnosis are solved. 

Two major approaches to the measure- 
ment of personality have developed out 
of the innumerable investigations of hu- 
man motivation, psychic mechanisms, 

‘This study was completed in April 1955, at 
which time the first author was a faculty member 
and the second author was a student in the De- 
partment of Social Relations at Harvard Univer- 
sity. The experiment was conducted at the Har- 
vard Psychological Clinic Annex and was facili- 
tated by research funds granted to Henry A. 
Murray by the Rockefeller Foundation and the 
National Institute of Mental Health (Grant M- 
joo). It was also facilitated by a grant to the 
first author from the Harvard Laboratory of 


Social Relations. In addition to our gratitude to 
Henry A. Murray, we wish to express our ap- 


preciation to Marjorie C. Ingalls, Carol Paige, 
and Veronica Peabody, who provided invaluable 
assistance in the execution of this research 


Columbia University School of Law 


The first’ in- 


volves simply the straightforward evalu- 


and mental aberrations. 
ation of behavior as observed through 
external appraisal and reported intro- 
spection. With this approach, the mature 
individual is regarded as in the main a 
rationally functioning organism, operat- 
ing as a result of clearly focused mo- 
tivational energy, and capable of self- 
comprehension. An individual is consid- 
ered abnormal when he loses self{-com- 
prehension because of repressed and 
misdirected motivation and consequent 
loss of ego dominance. Normal and ab- 
normal behavior are viewed as separate 
psychological phenomena, each having 
distinctive functional patterns. 

The second approach to personality 
was stimulated by the rise of the psycho- 
analytic school of psychological thought, 
which stressed human irrationality and 
determinism. The emphasis upon un- 
conscious, instinctual motivation and the 
dictatorial influence of early childhood 
learning experiences upon adult person- 
ality allowed even the mature, healthy 
individual little self-understanding. The 
psychoanalytic scheme of human logic 
harshly contradicted the dignified intel- 
lectual conceptions of human rationality 
and mental order. According to these 
rules of psychodynamics, personality as- 
sessment cannot be successfully accom- 
plished by merely investigating manifest 


scious motivational systems which un- 
derlie all overt behavior and direct per- 
sonality dynamics must be revealed in 
order to achieve real understanding. The 
introduction of projective methods of 
personality assessment sought to gain 
this understanding. Based on the psycho- 
analytic mechanism of projection, these 
assessment techniques are designed to 
elicit information de- 
fensively repressed into unconsciousness 
by the individual. By presenting un- 
structured, ambiguous stimulus material 
to be interpreted by the individual, the 
projective instrument produces responses 
reflecting unique personality make-up. 
“It is assumed in these techniques that 


which has been 


the individual organizes events in terms 
of his own motivations, perceptions, at- 
titudes, ideas, emotions, and all other 
aspects of his personality” (3, p. 5). 
Strangely enough, the apparently efh- 
cient combination of direct and projec- 
tive approaches to personality measure- 
ment has proved a mixed blessing. Aside 
from their presumably distasteful philo- 
sophical connotations, projective tech- 
niques have presented serious practical 
problems relating to scoring, cost of ad- 
ministration, and structural design. One 
problem of exceptional significance is 
that of intervalidation between projec- 
The 
question of whether projective and di- 


tive and direct assessment measures. 


rect. techniques both make essentially 
the same appraisal of the normal indi- 
vidual has important practical and theo- 
retical implications. Comparison of di 
rect and projective testing results has 
revealed frequent contradictions, and as 
a result there has been an increasing in- 
clination to regard the two methods as 
separate entities, each measuring a dif- 
ferent aspect or level of personality. This 
hasty and imprudent dichotomization 
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behavior and_ self-report. The uncon- 


has undoubtedly impeded progress to- 
ward a unified approach to personality 
assessment and has unnecessarily con- 
fused an issue strongly in need of clari- 
fication. 

Allport, in a recent publication (1), 
has discussed the unreasonable separa- 
tion of direct and projective personality 
assessment measures and the growing 
tendency to ignore the information 
gained through direct measures in favor 
of that gained by means of the devious, 
more complex projective techniques. 
The trend of motivational theory in 
modern dynamic psychology has been 
toward irrationality and geneticism, ac- 
cording to Allport, and away from accept- 
ance of healthy, adult, psychogenic mo- 
tives at their face value. Allport asserts 
that both direct and projective measures 
will reveal the same information about 
the motives of the normal, well adjusted 
personality. In dealing with the neurotic, 
however, whose motives are repressed 
and displaced by fears and _ hostilities, 
projective techniques are best able to 
elicit repressed material, and informa- 
tion may be obtained which is contra- 
dictory to conscious report. 

Thus the direct responses of the psychoneurotic 
cannot be taken at their face value. The de 
fenses are high, the true motives are hidden 
and are betrayed only by a projective technique. 
The normal subjects, on the other hand, tell 
you by the direct method precisely what they 
tell you by the projective method. They are all 
of a piece. You may therefore take their mo 
statements at face value, for even if 


you probe you will not find anything substan 
tially different (1, p. 110) 


tivational 


\llport further stresses the need for the 
complementary usage of both direct and 
projective methods in order that projec- 
tive measures can be properly interpre- 
ted and a maximally accurate personal- 
ity assessment be made. The cogent argu- 
ments set forth by Allport concerning 


direct and projective test consistency 


2 


among normal individuals and his de- 
fense of the validity of direct personality 
assessment of normals have stimulated 
the present experimental research and 
study of the relevant literature. 


THE PROBLEM 


Background and Relevant Literature 

Investigators have long inter- 
ested in the effectiveness of direct tech- 
niques such as questionnaires, autobiog- 
raphies, self-rating scales, and personality 
inventories, in obtaining a veridical per- 
sonality picture. On the basis of empiri- 
cal findings, the validity of these direct 
methods of assessment has frequently 
been challenged. 


Ellis (14), in an exhaustive survey of the litera- 
ture, reviewed a total of 247 validity studies of 
personality questionnaires. In summarizing his 
findings, he concludes that “. . . group adminis- 
tered paper and pencil personality question- 
naires are of dubious value in distinguishing 
between groups of adjusted and maladjusted 
individuals and that they are of much less value 
in the diagnosis of individual adjustment or 
personality traits” (14, p. 426). 

Frank (18), in a study intended to determine 
the comparative stability of response to items on 
a personality inventory, administered the Bern 
reuter Personality Inventory to prison inmates 
some of whom were suffering minor personality 
disorders. The subjects’ high degree of response 
vacillation and inconsistency led Frank to report 
that until the influence of situation and 
the various factors in motivating responses can 
be definitely stated, the personality questionnaire 
will remain a dubious clinical instrument” (18, 
$23) 

Landis (34) and Dudycha (13) both found di- 
rect measures unsatisfactory as distinguishing 
measures. Landis administered three inventories 
to a group of normals and a group of mental 
hospital patients, and Dudycha matched the 
results of self-ratings with observer evaluations, 
using groups of “dependable” and “undepend 
able” students. Results in both studies were in 
consistent and contrary to logical prediction. In 
comparing the results of a personality inventory 
and rating scales administered to a group of 
high school students, Jackson (29) found low in- 
tercorrelations and diversity of test performance 

\ number of studies have tested the fakability 
of direct measures and conscious influencing of 
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results by 
Terman (30) instructed a group of college stu- 


test the subject. Kelly, Miles, and 
dents to distort their the Stanford 
Masculinity-Feminity Test in a given direction. 
In comparing the results with those previously 
obtained from the same group taking the test 
honestly, it was evident that the subjects were 
able to alter their scores considerably. Kimber 
(32) obtained a significant difference in scores on 
the California Test of Personality from college 
students instructed, first, to answer honestly and, 
then, as well adjusted, ideally happy people. In 
a personality study of 
Spencer (56) questioned the subjects about the 
test The subjects 
generally admitted that they would have been 
truthful had they not been guaranteed 
anonymity. Mais (42) and Longstaff (40) both 
have demonstrated the fakability of the Jurgen- 
son Classification Inventory. both studies, 
subjects were instructed to attempt to distort 
their responses in a certain direction. Hunt (27) 
obtained significant distortion on the MMPI 
from subjects instructed to “fake good” and to 
“fake bad.” Wesman (61) administered the Bern- 
reuter Inventory to a group of college students 
in a hypothetical employment situation. The sub- 
jects, pretending ta be job applicants, performed 
distinctly average on adjustment 
The problem of faking and response 
distortion has concerned test designers and clini- 
cians alike, and several schemes for detecting and 
controlling for distortion or personality question- 
naires have been devised, such as the K factor in 
the MMPI developed by Meehl and Hathaway 
44) 

Most of the fakability studies in the existing 
been conducted 
The 
students ex- 


scores on 


high school students, 


honesty of their responses 


less 


above good 


criteria. 


unfortunately 
in highly artificial laboratory 
majority have employed college 
plicitly instructed to distort their responses to 
personality measures or to deliberately influence 
scores certain (39). Two 
studies have managed to avoid this limitation 
Thurstone Person 


literature have 


conditions 


according to criteria 
Schott (55) administered the 
ality Schedule to three groups of male subjects 
neuropsychiatric patients, applicants for profes 
sional positions, and normals, Comparing these 
three groups with the group of college freshmen 
tested by Thurstone, Schott found that the job 
applicants obtained above average good adjust 
ment while the group and the 
college freshmen both obtained average scores. 
Schott regards the scores of the job applicants 


scores, normal 


as unreliable indices of emotional adjustment 
because of the “The 
applicants evidently knew what answers should 
accordingly 


motivation conditions 


produce the best and 
their scores are better measures of sophistication 
than of the degree of emotional stability” (55, 


p. 238). 


impression 


ye 


Moving closer to actual employment assess- 
ment conditions, Herzberg (24) compared the 
performances of three groups on the Guilford- 
Zimmerman test: an industrial group consisting 
largely of college graduates, a group of voca- 
tional guidance clients about half of which were 
college students or graduates, and a group of 
college students. The industrial group produced 
the highest good adjustment scores, and the 
scores of all the groups seemed to vary accord- 
ing to educational level, the subjects with higher 
education presumedly having greater sophisti- 
cation and insight into the nature of the test 
items. From this evidence Herzberg concludes 
that clients for employment or promotion do 
fake test responses and that faking occurs to a 
greater extent at higher educational levels. 

Reliable personnel assessment techniques are 
of vital importance in the modern industrial 
setting. The complex nature of the present-day 
industrial set-up, requiring harmonious social 
relations as well as individual efficiency, necessi- 
tates careful selection of both staff and line per- 
sonnel according to personality qualifications as 
well as technical abilities. The problem of the 
inadequacy of present assessment techniques is 
pointed out by Herzberg. “Psychologists have 
reached an advanced stage in the development 
of test measures which can be applied to the 
industrial situation. The least dependable of 
these measures lies today in the area of person- 
ality assessment. One of the major reasons for 
the wariness with which industrial psychologists 
approach temperament inventories is the trans- 
parency of such tests and their corresponding 
amenability to faking or pointing answers to 
achieve a desired effect” (24, p. 81). 


It is clearly evident from all the 


studies mentioned above that the effec- 
tiveness of direct personality assessment 
techniques depends heavily upon motiva- 
tional and situational conditions. The 
tendency for test subjects to malinger or 
disguise personality weak spots is pres- 
ent in the large majority of test situa- 
tions in both therapeutic and assessment 
work. Honest response seems to be ex- 
ceptional rather than typical. Thus, the 
serious problem in personality measure- 
ment is not a lack of well designed in- 
ventories and questionnaires, but rather 
the need for a method of ensuring honest 
response and valid information. 

The development of projective testing 
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has to a large extent served to provide 
such a method. Many questions concern- 
ing the level of behavior measured by 
projective techniques, scoring difficulties, 
subject-to-subject response variation, and 
other idiosyncrasies indicate the imper- 
fection of projective techniques in their 
present stage of development. Neverthe- 
less, their effectiveness has been well sup- 
ported by experimental and clinical evi- 
dence, and the insusceptibility to dis- 
tortion because of the ambiguity of stim- 
ulus material has made them valuable in- 
struments in personality assessment, es- 
pecially under motivational conditions 
conducive to dishonest response. 


Hutt (28) emphasizes the necessity of employ- 
ing projective measures particularly in thera 
peutic work, “To the extent that the stimulus 
[of a personality measure] produced a behavioral 
reaction of the subject which is at once basic and 
penetrating it is valid as a clinical procedure. 
From this viewpoint, the casual, conventional 
paper and pencil personality questionnaire, while 
it may be an excellent device for other purposes, 
is limited as an inclusive measure of personality 
assayal. It does not reveal temporary or perma- 
nent psychological processes or personality mani- 
festations through which the psychologist may 
gain access into the subject's difficulties and 
insight into his personality structure, his com 
plexes, conflicts, mental content, and the like” 
(p. 134). Meehl (43), although essentially in 
agreement with Hutt, suggests that self-report 
is also valuable in formulating personality ap- 
praisal and recommends that direct and projec- 
tive methods be used in cooperation rather than 
competition. 

Ihe value of projective testing 
methods have been well documented in psycho- 
logical literature (3, 19, 47). Yet surprisingly few 
studies have investigated the comparative effec- 
tiveness and intervalidity of direct and projective 
techniques administered to the same experi 
mental group and designed to measure a specific 
variable or syndrome. Hanna (23) compared clini- 
cal evaluations of subjects who had applied to a 
hygiene clinic for aid in adjustment to college 
with their the Thurstone Personality 
Schedule in an effort to validate the schedule 
as a measure of neurotic tendency, “Comparison 
of Personality Schedule with tentative 
classification as to maladjustment of subjects 
dealt with on the basis of fairly careful clinical 
procedure shows a far better than chance rela- 


use and 


scores on 


scores 


ye 
Gar 
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tionship. This relationship becomes more posi- 
tive as the degree, or extent, of maladjustment 
increases. . Such results would seem to indi- 
cate a rather useful function of the Personality 
Schedule in weeding out individuals . . . [need- 
ful] of psychological attention” (23, pp. 444). 

Laslett and Bennett (35) administered the 
Word Association Test, using the Kent-Rosanoff 
stimulus word list and the Bernreuter Inventory, 
to a group of college students. Although good 
subject-experimenter rapport was reported, there 
was a marked lack of agreement on neurotic 
tendencies between the two measures. The sub- 
jects were suspected of conscious protection of 
sensitive personality areas on the direct instru- 
ment as well as overstatement resulting from 
too great awareness of minor weaknesses. 

In an experiment designed to study the effect 
of phrasing in personality questionnaires on 
honesty of response, Ellis (15) found significant 
differences between responses made to indirectly 
phrased items and those made to directly 
phrased items. Ellis used two experimental 
groups, a group of well adjusted boys and a group 
of problem children, and administered a person- 
ality inventory made up of items gathered from 
various standard personality inventories. Half of 
the items were phrased indirectly and randomly 
mixed with the directly phrased items. The 
scores of the two groups did not differ signifi 
cantly on the direct items, while the problem 
child group showed considerably more malad- 
justment on the indirect items. Within both 
groups a significantly greater number of malad- 
justment responses were given to the indirect 
items. 

Sacks (53) and Hanfmann (22) have studied 
the Sentence Completion Test and compared 
responses to first and third person sentence stems 
Sacks, testing a group of patients in a mental 
hygiene clinic, found significantly greater malad- 
justment indicated in responses to third person 
items and regarded the maladjustment 
scores on first person stems as a measure of 
defensiveness. In her study of sources of refer- 
ence in responses to sentence completion items, 
Hanfmann discovered that a large percentage 
of responses to third person items were deter- 
mined by unconscious self-reference 

Research which directly compares self-report 
and projective responses was conducted by Get- 
zels (20). Employing his “method of paired direct 
projective questionnaires,” Getzels__ per- 
formed two experiments, one dealing with per 
sonality differences between clinically judged 
“well adjusted” and “maladjusted” veterans who 
were applying for financial grants at a Veterans 
Administraion Guidance Center, the other a 
study of prejudice among the members of a psy 
chology class at a large Eastern college for 
women. In the former experiment, two tests were 
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administered. The first was a projective sen- 
tence completion test consisting of 40 third 
person stems. The stems were designed to relate 
to six categories of vocation and personal perti- 
nence and included a number of neurotic symp- 
toms. The second was a direct test consisting of 
20 multiple choice items which matched 20 of 
the sentence completion items. The items were 
first person sentence stems for which were pro- 
vided four possible completions; the subject was 
instructed to select one. The results demonstrated 
to a significant degree that the maladjusted vet- 
erans gave no more maladjusted responses to the 
direct items than did the well adjusted veterans, 
yet on the projective measure the maladjusted 
group showed considerably more maladjustment 
than did the healthy group. The maladjusted 
group also were more inconsistent in their re- 
sponses to matched direct and projective items. 
A direct item such as, “When I 
decision " might be completed “I act 
promptly”; whereas the projective 
“When Fred has to make a decision * might 
elicit the response “he feels unsure.” Since the 
maladjusted veterans were anxious to “pass in- 
spection’ 


must make a 


phrasing, 


‘in order to receive their requested 
grants, Getzels suspected that performance on 
the direct items artificial and indicated 
defensiveness. The projective items, on the other 
hand, were much less transparent and conse- 
quently bypassed defensive barriers and evoked 
honest personality qualities. Getzels concluded 
that, where self-interest was served by con- 
formity to normative expectations, projective 
measures of personality would most effectively 
discriminate between adjusted and maladjusted 
individuals 


was 


The second study reported by Getzels utilized 
the same technique of matched direct and in- 


direct questions. Three classes in psychology at 
a women's college were given a sentence com- 
pletion test composed of 59 third person items 
designed to assess attitudes toward family, poli 
tics, religion, and prejudice and to reveal related 


personality traits. The test was called a ‘verbal 
speed test” and the subjects were instructed to re- 
spond as quickly as possible. Two forms of the 
direct test were administered to the three classes 
under different conditions. Both direct tests con- 
sisted of 40 first person sentence completion 
items which matched 40 of the projective items 
Test A having multiple choice responses and 
Test B being open ended. The first class was 
given the multiple choice form and instructed to 
sign their names since the information was being 
gathered “for the record office.” The second class 
was also given the multiple choice form; how- 
ever, it was explained that the information was 
necessary for use in scientific research and that 
the tests were to be answered anonymously. The 


third class was administered the open ended 
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form of the direct test, and instructions were the 
same as those given to the first class. In all three 
groups the projective scores on prejudice were 
higher than the direct scores. The least dis- 
crepancy between direct and projective prejudice 
scores appeared in the group which had been 
guaranteed anonymity. The distribution of the 
direct prejudice scores tended to approximate a 
J] curve, while the projective scores were normally 
distributed. These results confirmed the hypothe- 
sis that, in responding to socially conflicted ob- 
.jects of inquiry, the overt response or expressed 
reaction will be more in conformity with social 
expectations than with personal sentiments 
Getzels’ findings in both experiments demon- 
strate the necessity among socialized individuals 
of masking the true self in certain social situa 
tions as a vital adaptive function. 

A recent study by Davids (8) has brought into 
sharper focus the problem of comparative per- 
formance on direct and projective personality 
assessment techniques under certain motivational 
conditions. Two experiments were performed, 
both using normal subjects and employing the 
same experimental measures. The variable op- 
erating to distinguish the two experiments was 
motivational condition. The first group of sub- 
jects were male college undergraduates partici- 
pating in an extensive research program studying 
personality. Good experimental rapport had 
been established, and all the subjects were 
guaranteed anonymity. The second group, also 
college undergraduates, underwent the test bat- 
tery under the impression that well adjusted 
performance would qualify an individual for a 
high paying job as research assistant to staff 
members conducting psychological experiments. 
The subjects were told that the test battery 
would determine the selection of several persons 
who seemed well suited for work requiring mini- 
mal interference from personal problems. 

The test battery used in both experiments 
consisted of five assessment techniques: the 
Psycho-Somatic Inventory, a happiness self-rating 
scale, a word association test, a sentence com- 
pletion test, and a clinical evaluation. The sub- 
jects were ranked by a clinical psychologist on 
ego structure. Ego structure or ego strength has 
been defined by Murray and Kluckhohn (33) 
and is considered to be highly correlated with 
good adjustment. The Psycho-Somatic Inventory 
and happiness self-rating scale were used as 
measures of neuroticism and poor adjustment. 
The two projective instruments were structured 
to measure eight personality dispositions which 
together form a symptomatic group called “the 
alienation syndrome.” The syndrome consists of 
three pairs of polar variables: optimism-pessi- 
mism, trust-distrust, and sociocentricity-egocen 
tricity; and two additional negative dispositions: 
anxiety and resentment. The three positive or 
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socially desirable dispositions—optimism, trust, 
and sociocentricity—are symptoms of nonaliena- 
tion and indicate good adjustment. The five 
negative or socially undesirable variables are 
qualities symptomatic of maladjustment and 
because of their high intercorrelation have come 
to be regarded as forming an alienation syn- 
drome (6, 7, 9). 

In accordance with Allport’s assertion that 
personality test performance by the normal, 
healthy adult will be consistent in direct and 
projective measurement, Davids predicted in 
both experiments that (a) neuroticism as meas- 
ured by the Psychosomatic Inventory would cor- 
relate with poor adjustment as measured by 
each of the other direct and projective meth- 
ods of assessment, and (b) there would be 
a significant degree of concordance among rank 
orders on the five measures of poor adjustment. 
The two hypotheses were confirmed in both 
experiments. However, in comparing the actual 
magnitude of the correlations within each group, 
Davids found that the group tested under the 
employment conditions showed less consistency 
from measure to measure and obtained lower 
scores on all measures. The results 
of this study, although in support of the con- 
tention by Allport that direct and projective 
personality assessments of normals will be con- 


alienation 


gruent, demonstrate that performance on per- 
sonality measures is a function of many variables, 
one of the most important of which is motiva- 
tional conditions. 


General Statement of the Problem, Aims, 
and Approach of the Present Investiga- 
tion 


The purpose of this study is to further 
investigate the relative performances on 
direct and projective personality assess- 
ment measures by subjects under differ- 
ent motivational conditions. Two groups 
of subjects were obtained, and a battery 
of nine personality assessment tests was 
assembled, including four projective 
measures and five direct instruments. In 
addition, all the subjects in both groups 
were interviewed by a clinical psycholo- 
gist who evaluated the subject accord- 
ing to certain personality criteria. The 
experimental group of subjects consisted 
of 23 male college undergraduates con- 
tacted through the college employment 
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ofice. Upon inquiry by the office direc- 
tor, each subject expressed an interest in 
a proffered position as research assistant 
at a psychological clinic. Acting as a 
control group were 20 undergraduates 
participating in an extensive personality 
testing program. All had volunteered 
their services and were paid at student 
employment The test battery, 
structured to measure directly pro- 


rates. 


jectively a definite personality constella- 
tion, was administered to the experi- 
mental group as a personnel selection 
screening process, ostensibly as a means 
of selecting a well adjusted personality. 
The control subjects took the same tests 
as part of the larger testing program in 
which good tester-testee rapport was 
carefully established and all information 
kept strictly confidential. It was expected 
that, as a result of the dissimilarity of 
testing conditions, the two groups would 
perform differently on the test battery. 
The experimental design was based 
largely upon that used by Davids in the 
study described above, with a similar 
focus on the assessment of alienation and 
related symptoms of maladjustment un- 
der different conditions of motivation. 
However, since the Davids study con- 
tained some procedural weaknesses and 
posed several problems for future re- 
search, an effort was made in the present 
study to improve and extend the design 
in an attempt to secure more conclusive 
findings. —I'wo major revisions were of 
primary importance: the first relating to 
the selection of subjects, the second per- 
taining to the assembly of the test bat- 
tery. The subjects in both of Davids’ 
groups were very closely matched. All 
were selected from a larger group of stu- 


dents who had volunteered to take psy- 


chological tests, and most were psychol- 
ogy or social relations majors. The group 


tested after being informed of the availa- 
ble job were not seriously in need of 
employment, and success in getting the 
position was not as critical as it is in 
most real employment instances. In the 
present study the subjects tested in the 
employment situation were not psychol- 
ogy majors, did not volunteer to take 
psychological tests, and were registered 
with a college employment office as need- 
ful of employment. Thus an artificial, 
yet thoroughly convincing, employment 
situation was created, and experimental 
“future employees” little 


psychological sophistication were tested. 


having very 


The second major extension of the 
Davids study was the assembly of a uni- 
formly structured test battery which in- 
cluded both direct and projective tech- 
niques, specifically designed to measure 
the alienation syndrome. Neither of the 
Davids 
were measures of alienation; both were 


two direct instruments used by 
positively correlated with the alienation 
syndrome. Also, the clinical evaluation 
ratings used by Davids were on ego 

rather Al- 
weak ego been 


found to be highly correlated with alien- 


structure than alienation. 


though structure has 
ation (7), for the sake of consistency a 
clinical rating on alienation should have 
been used. In the present study this re- 
finement has been made. Finally, since 
lacked either of 
most populat projective instruments cur- 
the Rorschach the 
Thematic Apperception ‘Test—the exper- 
imenters included the TAT in the pres- 
ent battery. 


Davids’ study the two 


rently in’ use and 


Thus, fundamental to the design of 
this research are three elements: (a) a 
control group under optimum motiva- 
tion to respond honestly to personality 
assessment measures; (b) an experimen- 
tal group, interested in gaining employ- 
ment, undergoing a_ personnel assess- 


ment procedure; and (c) a test battery 
comprised of direct and projective tech- 
niques uniformly structured and de- 
signed to measure the alienation §syn- 
drome. This approach is considered to be 
an effective means of studying the prob- 
lems of direct-projective test consistency 
among normals and the influence of 
strong motivation on responses to per- 
sonality assessment measures. 


Hy potheses 


1. Within a group of subjects moti- 
vated to respond honestly to personality 
assessment measures, test performances 
will be consistent on both direct and 
projective measures. 

a. There will be high correlation be- 
tween rank orders on all measures, direct 
and projective. 

b. There will be a significant degree 
of concordance among rank orders on 
direct measures, On projective Measures, 
and on direct and projective measures 
together. 

2. Within a group of subjects moti- 
vated to distort personality assessment 
measures in order to appear well ad- 
justed, test performances will be incon- 
sistent on direct and projective measures. 

a. There will be a low correlation be- 
tween rank orders on projective measures 
and on direct and projective measures 
together. There will be higher correla 
tion between direct measures. 

b. There will be an insignificant de 
gree of concordance among rank orders 
on direct measures, On projective meas 
ures, and on direct and projective meas- 
ures together. The degree of concordance 
will be greatest among direct measures. 

4. Within a group of subjects respond- 
ing honestly to personality assessment 
measures, neuroticism and anxiety will 
be positively correlated with alienation. 
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4- A group of subjects motivated to 
distort personality assessment measures 
will show less alienation and maladjust- 
ment on direct measures than a group 
motivated to respond honestly; on pro- 
jective measures, both groups will re- 
veal the same degree of alienation and 
maladjustment. 

a. The mean alienation scores of the 
distorting group on the direct measures 
will be significantly lower than those of 
the honest group. 

b. The individual alienation scores of 
the distorting group on the direct meas- 
ures will be less heterogeneous than 
those of the honest group, and will tend 
to cluster toward low alienation and 
good adjustment. 

c. The mean alienation scores of both 
the distorting group and the honest 
group will be the same on the projective 
measures, 

d. The individual alienation scores of 
both groups will be equally heterogene- 
ous on the projective measures, and will 
be similarly distributed throughout the 


alienation and maladjustment ranges. 


Pur EXPERIMENT 
Experimental Treatment 

A primary objective in the experimen- 
tal design was to create a real employ- 
ment situation which, in contrast to an 
artificial laboratory experiment, would 
stimulate serious incentive. This objec- 
tive was accomplished by providing the 
two elements necessary for such a condi- 
tion: a job and a group of men seeking 
employment. Only the experimenters 
knew that the job was fictitious. The 
subjects were contacted through the col- 
lege employment office, the actual selec- 
tion being made by an employment ofh- 
cial unaware of the purposes of the ex- 
perimenters. The subjects were simply 


e 
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informed of several job openings as re- 
search assistants at the Harvard Psycho- 
logical Clinic and instructed to report 
for an interview. Each job applicant was 
briefed as to the nature of the work and 
told that final selection of applicants 
would be based on the results of a screen- 
ing procedure involving a battery of 
tests. The interviewer indicated that the 
pay rate for research assistants would be 
considerably above that usually offered 
by college employers. 

The position described to the prospec- 
tive employees was that of observer and 
recorder of small group interaction ses- 
sions. They were told that a program of 
research with relations be- 
tween personality attributes of individ- 
ual members and efficiency of group 
functioning was being planned and as- 
sistants were needed to work behind a 
one-way vision screen recording such 
things as aggression and withdrawal by 
the group members, critical or construc- 
tive comments, and group functioning as 
a whole. When explaining the nature of 
the research assistant position during the 


concerned 


interview, the experimenter implied that 
a “mature, well adjusted’’ personality 
was requisite in order that observations 
of behavior might be free of bias stem- 
ming from personal problems. At all 
times during the testing procedure the 
attitudes of the experimenters toward 
the subjects were those of personnel se- 
lection men toward prospective employ- 
ees. No attempt was made to gain close 
rapport as in clinical testing, nor were 
the subjects antagonized by any super- 
ciliousness or discourtesy. At the end of 
the testing process, the subjects were 
paid for their time at the prevailing stu- 
dent employment rate and told that they 
would be contacted within 
and 


two weeks 
informed of their selection or re- 
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jection. personal letter subse- 
quently mailed to each subject explain- 
ing that an anticipated research grant 
had failed to come through, conse- 
quently making it impossible to hire any 
additional research assistants. 

Thus, in spite of the unavoidable ne- 
cessity of using college undergraduates as 
subjects and of carrying out the testing 
procedure in a psychological clinic, an 
employment situation was created which 
was in no way fictitious or hypothetical 
in the minds of the subjects. The sub- 
jects needed work, a good paying job 
was offered, and a personnel selection 
process was administered supposedly for 
the purpose of “mature, 
suited 


obtaining a 


well adjusted” personality for 


work in psychology. 


Subjects 
The Control Group 


The control group consisted of 20 male under- 
graduates who had volunteered to take part in 
an extensive study of personality under the di- 
rection of Henry A. Murray at the Harvard 
Psychological Clinic. In response to a question- 
naire distributed in a large undergraduate course 
in Social Relations, these men, almost all ma- 
joring in Social Relations or Psychology, ex- 
pressed a desire to participate in psychological 
experimentation. The group included sopho- 
mores and juniors of varied social and economic 
backgrounds, and having diverse interests, abili- 
ties, and personalities. The subjects were paid 
at current student employment rates during the 
experimenting which continued for two years 
and included a comprehensive range of standard 
and experimental assessment measures. At the 
outset of the experimentation the subjects were 
guaranteed anonymity and encouraged to be 
honest and uninhibited in responding to the 
various test measures. They were assured that in 
this way they could make the effective 
contribution to science. Good rapport was estab 
lished between the staff of investigators and the 
subjects which further inspired trust and candid- 
ness. 


most 


The Experimental Group 


Iwenty-three male undergraduates were ob- 
tained through the Harvard Employment Office 
The office was requested to contact approximately 
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20 men registered with it as desirous of employ 
ment who might be interested in a job as re- 
search assistant at the Harvard Psychological 
Clinic. The Emplyoment Office official was in 
structed that students who were not majoring in 
Social Relations or Psychology were preferred for 
the position. The rationale for this preference, 
which was communicated to both the Employ 
ment Office and the candidates, was that the 
subjects to be used in the forthcoming small 
group research would be drawn from courses in 
personality psychology and that there would be 
less likelihood of the research assistants being 
acquainted with the people they would be ob- 
serving if they themselves were not majoring in 
the same field. An additional reason given for 
the preference was that untrained 
having a small knowledge of psychology might 
tend to project a certain amount of inappropriate 
psychological sophistication into observations of 
behavior. Consequently, in this circumstance, the 
tabula rasa individual with regard to psycho 
logical knowledge would be better suited than an 
individual having an elementary understanding 
of psychology. 


observers 


A total of 23 men eventually reported to the 
Clinic for the initial interview. All expressed 
considerable interest in the type of work being 
offered and remained in competition for the 
positions. This group, like the control group, 
contained men with varied personalities and 
interests. Both groups, then, consisted of male 
undergraduates and had comparable intragroup 
diversities. However, the experimental group 
differed from the control group in three im- 
portant aspects: They were not Social Relations 
or Psychology majors, they did not volunteer to 
take psychological tests, and they were seriously 
interested in obtaining employment. 


Measures 


‘Ten measures of personality were em- 
ployed in this study. Nine of the measures 
involved written or verbal responses to 
objective and projective tests. The tenth, 
a clinical evaluation by an experienced 
psychologist, was based on the initial in- 
terview, the subject unaware that his in- 
terviewer was~ collecting information 
from which an evaluation and alienation 
ranking could be made. Five of the tests 
were questionnaires and self-rating scales 
which directly measured alienation or re- 
lated variables such as anxiety and hap- 


piness. The remaining four were projec- 
tive instruments designed particularly to 
tap the alienation syndrome. Of these in- 
struments, to be briefly described below, 
the following were designed by Davids 
(6) and are not yet published: the Self- 
Rating Scale, the Affect Questionnaire, 
the Word Association Test (stimulus 
word list), the Sentence Completion Test 
(sentence stems), and the Azzageddi Test. 
Detailed descriptions of the development 
of these measures and copies of the test 
materials and instructions are presented 
in Davids’ doctoral dissertation (6). The 
utility and validity of these instruments 
have been demonstrated in several recent 
studies (6, 7, 8, 9, 11). In the present 
study, the battery of assessment proce- 
dures was specially selected for the 
experimental group and was adminis- 
tered according to a_ predetermined 
schedule of experimental sessions. The 
control group took the same tests along 
with many additional measures as part 
of the intensive personality research pro- 
gram in which they were participating, 
and thus the order of administration dif- 
fered in the two groups. 


The Clinical Evaluation 


Both groups were interviewed by a clinical 
psychologist before the commencement of the 
testing program. The interview lasted about 25 
minutes and mainly fact finding. In the 
case of the control subjects, the apparent pur- 
pose was to interest the person in the research 
project and acquaint him with some of the aims 
of the study. The experimental subjects sup- 
posedly were being interviewed for employment, 
the interviewer acting as a prospective employer 
rather than a reseach director. In both groups 


was 


the interviewer noted carefully personality varia- 
bles such as cooperativeness, friendliness, aggres- 
sion, neuroticism, ahxiety, etc. Attitudes and 
values, as well as past experiences, were elicited 
during the course of the interview by means of 
seemingly casual questions and pertinent con 
versation. At the end of each interview, the 
psychologist reviewed the material gathered dur- 
ing the meeting and rated the subject on the 
variables in the alienation syndrome. On the 


= 


basis of information and impression derived 
from the interviews, the subjects within each 
group were ranked on alienation? 


The Direct Instruments 


The Self-Rating Scale. This scale (6) asks the 
subject to rate himself on the eight personality 
dispositions contained in the alienation syn- 
drome. Each disposition is subdivided into four 
areas in which the trait might be manifested. 
Each subdivision is defined, and examples are 
provided in the instructions. A summary scale 
follows each set of subdivisions and asks the 
subject to make a summary or over-all self-rating 
on the particular trait. The entire questionnaire, 
then, contains 40 scales on which the subject 
rates himself, the eight dispositions each having 
four scales dealing with subareas, and one sum- 
mary rating on the individual trait as a whole. 
For example, the disposition “pessimism” is di- 
vided into: world situation, family relations, 
social relations, and personal success, Following 
these is a scale measuring general pessimism 
All 40 ratings are made on six-point scales run- 
ning from “much less than most people” to 
“much more than most people.” This system 
thus allows a maximum score of 240 and a mini- 
mum of 40. 

The Happiness Scale. This scale, devised by 
Goldings (21), is also a self-rating measure. The 
subject rates his usual level of happiness, com- 
pared with others of same sex and age, in eight 
areas of life activities. He then indicates the 
relative importance of these areas in determining 
his general level of happiness. The areas include 
family relations, intellectual work, personal 
friendships, health, sex relations, etc. Finally, the 
subject rates himself on general happiness. The 
ratings are again on six-point scales ranging 
from “much less happy than most” to “much 
more happy than most.” The scoring system, 
described in detail by Goldings, results in a sum 
mary score which can range from one to six. 

The Affect Questionnaire. This measure con- 
sists of 80 statements designed to relate to the 
alienation variables, 10 statements being devoted 
to each disposition. The subject is asked to indi- 
cate his feeling about each statement at a point 
on a six-point scale ranging from “strongly dis- 


*'The control subjects were actually inter- 
viewed twice, the first meeting of more brief 
duration for the purpose of selecting the final 20 
subjects to be used in the research. Each subject 
also submitted an autobiography containing past 
history and experiences which was read by the 
interviewer. The ranking of the control subjects 
was thus based on information gathered from the 
two interviews and the autobiographies. 
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agree” to “strongly agree.” Sample statements 
are: “the world is a bright and cheery place,” 
“very few people can be trusted,” “we are sur 
rounded by all sorts of dangers,” “to put some 
people in their place it is necessary to use 
physical force or threats of force.” The state- 
ments are arranged in a random, but systematic 
order. Scoring is on a six-point basis, complete 
agreement receiving a score of six and complete 
disagreement receiving a score of one. A more 
detailed description of the instrument and the 
scoring method is given by Davids (6, 7). 

The Psycho-Somatic Inventory. This inventory 
presents a list of g2 symptoms of physical and 
mental disturbances; the subject indicates how 
often he has experienced these symptoms. Each 
symptom is followed by four possible responses; 
“often,” “at times,” “seldom,” “never.” The sub- 
ject is asked to check the response which best 
describes him. Sample items are “feel well and 
happy?” “jumpy or 
easily startled?” “feelings of guilt or sinfulness?” 
“headache, pressure of pains in head?” The total 
score on this inventory is regarded as an index 
of neuroticism, with high positive scores charac- 
terizing normals and high negative scores charac 
terizing neurotics. A complete description of the 
development, scoring and standardization of the 
inventory has been published by McFarland and 
Seitz (41). The utility of this measure in the OSS 
assessment program is reported in Assessment of 
Men (50). 

The Taylor Manifest Anxiety Scale. Like the 
Psycho-Somatic Inventory, this questionnaire con- 
tains a list of signs of physiological and _ psy- 
chological maladjustments. It is designed to focus 
on manifest anxiety, however, rather than the 
general neurotic syndrome. Fifty symptomatic 
statements are to be checked as true or false by 
the subject according to whether or not each 
applies to him. Examples are: “I frequently find 


“troubled with shyness?” 


myself worrying over something,” “I am easily 
embarrassed,” “I am troubled by attacks of 
nausea,” “I work under a great deal of tension,” 


“I cry easily.” Each anxiety symptom indicated 
as “true” by the subject receives one point; thus 
an individual score could range from © to 50. 
Taylor has presented a detailed description of 
the development of this instrument (59, 60), and 
several recent studies have employed this meas- 
ure of anxiety in the selection of experimental 
subjects (4, 10, 12, 45). Validity studies, relating 
this scale to other measures of anxiety, have 
yielded varying results (5, 16, 25, 26, 31). 


The Projective Instruments 


The Word Association Test. The word asso- 
ciation method is a well known and widely used 
technique. Its effectiveness both in clinical diag- 
nosis and experimental investigation has been 
demonstrated, and many lists of stimulus words 


, 
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are available (2, 3). However, since a major focus 

in the present research was the investigation of 
the alienation syndrome in personality, it seemed 
appropriate to design a word list which would 
relate to the variables under study. Accordingly, 
a list of 100 nouns was devised (6) consisting of 
20 neutral stimulus words and 10 words designed 
to associate with each of the eight dispositions 
in the alienation syndrome. In order to facilitate 
group testing and ensure constant testing con- 
ditions from group to group, the stimulus words 
were electrically recorded on tape using a high 
fidelity Magnecorder, Type PT6-AH. A 20-second 
interval follows each word on the recording, 
during which the subject writes his chained 
associations on a response sheet. In this study, 
every response to each of the stimulus words 
was scored—a procedure that involved the scoring 
of nearly 20,000 response words. Every response 
word in each test protocol was analyzed and 
classified into one of nine possible categories (the 
eight dispositions plus a miscellaneous category). 
The nine categories were then reduced to three: 
the number of associations scored under the five 
negative dispositions (alienation), the number of 
associations scored under the three positive dis- 
positions (nonalienation), and the total number 
of miscellaneous responses. A final alienation 
score was derived by dividing the total number 
of alienation responses by the number of words 
scored under all eight categories (excluding mis- 
cellaneous responses). In this way each subject 
received a percentage score on alienation. 

The Sentence Completion Test. Like the Word 
Association Test, the sentence completion method 
is a firmly established technique in both experi- 
mental and diagnostic psychological work (2, 51, 
58). This method is also amenable to specific 
design so as to enable the investigator to meas- 
ure particular attributes. Thus, 100-item 
Sentence Completion Test was designed (6) which 
contained a few stems taken from available tests 
but mainly stems written especially for the pres 
ent investigation which would secure data per- 
tinent to the alienation variables. Of the 
items 20 were designed to be neutral, and 10 
stems were designed to relate to each of the 8 
variables. Half of the stems were in the first per 
son, the other half in the third person. The 
various stems were distributed in random fashion 
throughout the test. Standard procedure for 
group administration was followed, each subject 
receiving a mimeographed copy of the test and 
instructions which emphasized speed. The 100 
responses were scored under the nine scoring 
categories mentioned in the above discussion of 
the scoring of the Word Association Test. A score 
was computed for each category, and then the 
five negative disposition scores were summed to 
provide a total alienation score. In addition, 
separate alienation scores were computed for 
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first and third person stem responses. As a result, 
each subject received three alienation scores based 
on number of alienation completions in re- 
sponse to: first person stems, third person stems, 
and the entire 100 stems. 

The Azzageddi Test. This unique instrument, 
originated by Davids and Murray (6, 11), is an 
auditory projective technique designed to meas- 
ure the alienation syndrome. The stimulus 
material for the test consists of a series of eight 
spoken paragraphs constructed by intermingling 
sentences and phrases expressive of the disposi- 
tions in the alienation syndrome. Each of the 
paragraphs focuses on one of the eight disposi 
tions, but also contains material representative 
of some of the other dispositions. More spe- 
cifically, each paragraph contains: four state- 
ments representative of a single disposition, four 
statements each associated with a different dis- 
position, and six short phrases each associated 
with a different disposition. Half of the material 
in each paragraph is phrased in extravagant, 
figurative language and includes both evaluative 
interpretations of the world and exhortations 
stating remedies for certain avowed conditions. 
Ihe intermingling of inconsistent ideas com- 
bined with the bombasity of the language causes 
each paragraph to sound confused and ambig- 
uous to the listener. The paragraphs were elec- 
trically recorded on tape using a Magnecorder, 
Type PT6-AH. Fach paragraph is approximately 
of one-minute duration and is followed by a 
three-minute interval of silence during which the 
subject writes material recalled from the para- 
graph. The subject is instructed to listen to each 
paragraph and then write on the provided an 
swer sheet: all the ideas he can remember from 
the paragraph, what he believes to be the major 
idea in each passage, and a few adjectives describ- 
ing the mood of the speaker. In order to moti 
vate performance on the test, it was referred to 
as a “test of intuitive intelligence.” In the scor- 
ing system employed in this study, all the re- 
sponses relevant to the five negative dispositions 
were summed in each test protocol. Each subject 
thus received an alienation score based on 
amount of alienated material recalled. 

The Thematic Apperception Test. Since its 
inception by Murray in 1935, the TAT (48) has 
become an exceedingly popular clinical instru- 
ment. The nature of the test and its value as a 
method of revealing dominant drives, sentiments, 
and conflicts in a personality are well known (2, 
47). In the present study, ten cards were selected 
and administered according to the revised TAT 
instructions recommended by Murray (49), the 
two important alterations in administration pro- 
cedure being the removal of the card from the 
sight of the subject after 20 seconds of observa 
tion and the request that the subject choose a 
proper name for the chief character before pro- 
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ceeding with the story. The cards used and the 
order of presentation were: 1, 3BM, 4, 5, 6BM, 
7BM, 11, 14, 15, 18BM (48). A scoring scheme 
devised by the experimenters, based on the for- 
mulations of Murray (48) and Stein (57), was de- 
signed to measure the alienation syndrome. The 
stories given by each subject were recorded on a 
Dictaphone and later transcribed verbatim. Each 
story was read, and signs of the eight alienation 
dispositions were noted. Attention was focused on 
five components of story content: (a) the behavior 
of the hero, his needs, feelings, motives, etc.; ()) 
the environmental situation (press), forces, stim- 
uli, pressures, etc.; (c) themas or inner states, 
feelings and emotions expressing the general 
tenor of the story, structural motifs; (d) the out- 
come of the story; and (e) the language used, 
phrasing, word usage, sentence structure, etc. 
Each story received a score ranging from o to g on 
all eight dispositions. If no traces of a disposi- 
tion were present in the story, a score of 0 was 
assigned, If slight indications could be noted, the 
particular disposition received a score of 1. 
When a disposition was distinctly present in one 
of the five main components described above, it 
was scored 2. When disposition was manifested in 
more than one of the components, it received a 
maximum score of g. In this way, both frequency 
and intensity of scorable material were taken into 
account. Each disposition was considered sepa- 
rately, and no canceling out procedure was used 
in handling the polar dispositions. Thus a story 
could achieve a maximum score of 24 and a 
minimum of o, and a total protocol score could 
range from 240 to o. A final alienation score for 
each subject was derived by subtracting the total 
number of points scored on the three positive 
dispositions from the total points received on 
the five negative dispositions, 


Statistical Treatment of the Data 


The following statistical techniques 
were used in analyzing the data gathered 
in this study. Spearman's rank-order cor- 
relation coefhcient (rho) was employed to 
indicate association 


between different 
measures. To test for the degree of agree- 
ment among several different rankings of 
the same group of subjects, the coefficient 
of concordance (W) was employed. ‘The 
7? test was used to test significance of 
W. These statistical techniques (46) were 
used to measure relationships within 
each group The ¢ test and the F test 
were employed for testing the signifi- 
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cance of differences between means and 
variances in the two groups. 

Reliability of scoring of the projective 
instruments was expressed in terms of 
percentage agreement between independ- 
ent scorers. 

A one-tailed test of significance was 
used in all instances where the direction 
differences between 
groups was predicted in advance of sta- 


of association or 


tistical analysis. The .o5, level of signifi- 
cance was set as the level required to re- 
ject the null hypothesis. 


Reliability of Scoring of Proyective 
Measures 


Since the scoring of the projective 
judg- 


ments by the scorer, it seemed appropri- 


techniques involved subjective 
ate to examine the reliability of the 
scoring procedures. Projective protocols 
were scored by an independent rater 
and, with the exception of the TAT, the 
percentage agreement between the two 
raters was used to indicate degree of re- 
liability. Because of the nature of the 
TAT response material and the scoring 
system used, Spearman's rank-order cor- 
relation was computed as a measure of 
reliability. 

For the Word Association Test, the en- 
tire protocols were independently scored, 
with each response being classified as 
positive (nonalienation), negative (alien- 
Percentage 
agreement these 
three categories. For both the Sentence 
Completion Test the 
Pest, one-quarter of each subject's proto- 
col was scored and percentage agreement 
computed for the same three scoring 
categories. The TAT protocols were 
scored by a second rater, and a rank- 


ation), or miscellaneous. 


was measured using 


and Azzageddi 


order correlation was computed between 
the alienation ranks assigned on the 
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TABLE 1 


INTERRATER AGREEMENT OF RELIABILITY SCoR- 
ING OF Four PRojectivE INSTRUMENTS 


Experi- 
Control 
Measures group 
; group 
(N= 20) 
(N = 23) 
Word Association Test 
Sentence Completion Test .83 
Azzageddi Test .96 
Thematic Apperception Test .87 75 


Note.—All figures represent percentage agree- 
ment, except the figures for the TAT, which are 
rank-order correlation coefficients. 


basis of the two independent scorings. 
The results of these reliability checks, as 
shown in Table 1, indicate a highly satis- 
factory degree of interrater reliability on 
each of the four projective measures. 


Administration of Measures 


The test battery was administered to 
the experimental group in four separate 
sessions. As was mentioned earlier, the 
control group did not receive the tests 
in the same order, but the instruments 
and testing conditions were the same for 
both groups. The following is the sched 
ule and approximate duration of the 
experimental sessions: 


1. The clinical evaluation (interview): 25 min- 
utes. 


2. The Azzageddi Test and Psycho-Somatic In 
ventory: one hour 


Measures Range of scores 


\ffect Questionnaire 106-239 
Self-Rating Scale 35-125 
Happiness Scale 24 
lavlor Scale 10-41 
Psycho-Somatie Inventory 151-353 
Word Association Test $8.81-80.74 
Sentence Completion Test 11-42 
\zzageddi Test 25-57 
rAT 1-49 
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(N = 20) 
Maximum possible 
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3. The Word Association Test and the Sen- 
tence Completion Test: one hour. 

j. The TAT, the Happiness Scale, the Affect 
Questionnaire, the Self-Rating Scale, and the 
Taylor Anxiety Scale: two hours. 


RESULTS 


In this chapter the experimental find- 
ings will be reported and discussed 
briefly. The findings within both the 
control and experimental groups will be 
presented and comparisons be- 
tween the two groups. Several of the 
projective instruments will be discussed 
in some detail. General discussion of the 
results and their implications will be 
reserved for the following section. 


Findings within the Control Group 


The direct and 
projective instruments are presented in 


scores on the nine 
Table 2. All of these represent the total 
alienation those for the 
Happiness Scale, the Taylor Scale, and 
the Psycho-Somatic Inventory. The wide 


scores except 


range of scores obtained on each measure 
and the variances indicate 
diversity of personalities within the con- 
trol group and suggest that the subjects 
were giving a fairly accurate account ol 
themselves and not attempting to ad- 


the size of 


normative 
standard. These results support the pre- 


here to some “acceptable” 


diction that, under the particular moti- 
vational conditions set up (anonymity, 


TABLE 2 
CONTROL Group SCORES ON NINE DIRECT AND PROJECTIVI 


INSTRUMENTS 


\ ariance 


Mean score 


range 


300 170.55 1,550.94 
25-150 75.60 407.52 
3.08 03 

20.80 63.00 

340-375 137.05 25,730.87 
o-100 61.90 58.09 

o-50 24.30 55-01 

o-70 36.05 64.80 


| | 
—Qo-150 22.600 163.904 
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TABLE 
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INTERCORRELATIONS (RHO) AMONG TEN MEASURES OF ALIENATION 


Clinical . P-S Self- 
Me: Faylor 
Measures evalua- Inven- rating 
scaie 
tion tory scale 


Taylor 

P-S 73 .76 

Self-rating .68 62 65 

Affect 59 47 
Happiness 47 52 2 1¢ 
52 78 66 52 
Azzageddi 35 50 61 26 
18 59 30 28 
WAT 75 81 74 71 


AND MALADJUSTMENT 


Control Group (N = 20 


Affect Happi- \ 
question- ness TAT 
naire scale — 


46 
07 

50 5 58 
42 41 
Os 


55 74 7 


Note.—For a one-tailed test, coefficients of .38 and .52 are significant at approximately the .o5 and 


.o1 levels. 


contribution to science, good experimen- 
tal rapport, etc.), the control subjects fon 
the most part would respond honestly 
and hence demonstrate personality di- 
versity within the group. 

The entire ten measures were inter- 
correlated using Spearman’s rho. The 
subjects were rank ordered on each meas- 
ure according to total alienation score, 
with the highest score ranked first. On 
the Happiness Scale, the Taylor Scale, 
and the P-S Inventory, lowest scores were 
ranked first so that the subjects were 
ranked according to unhappiness, mani- 
fest anxiety, and neuroticism. The inter 
correlation matrix is presented in Table 
3. Of the 45 coefficients of correlation, 
only 6 fail to attain significance at the 
.o5 level. Twenty-eight coefficients are 
significant at the .o1 level. Five of the 6 
non-significant coefhicients were between 
projective instruments and direct meas- 
ures. This result can be partially attrib- 
uted to the imprecision of projective test 
scoring methods and to the necessarily 
ambiguous material elicited by projective 
measures. This consistency of test per 
formance demonstrates both honesty of 


responses and reliability of the various 
measures. The high correlations between 


the Taylor Scale and the P-S Inventory 
and the measures focusing on alienation 
indicate the relationship of neuroticism 
and anxiety to the alienation syndrome. 
The prediction of honesty and consist- 
ency of test performance is further con 
firmed by the accuracy of the clinical 
ranking on alienation, This ranking, 
made prior to the testing program, cor- 
related significantly with all but two of 
the test measures. 

The coefhcient of concordance and 
average rho were computed for three 
groups of measures. Group 1 contained 
four direct measures: the P-S Inventory, 
the Self-Rating Scale, the Affect Question- 
naire, and the Happiness Scale. Group 2 
contained the four projective instru- 
ments. Group 3 included the clinical 
evaluation, two direct measures, and two 
projective measures: the  Self-Rating 
Scale, the Affect Questionnaire, the Sen- 
tence Completion Test, and the Word 
Association Test. The results are shown 
in Table 4. The three highly significant 
coefhcients confirm the hypothesis of con- 
sistency within the control group on both 
direct and projective measures. Thus, in 
this group the direct and projective test 
performances were “all of a piece,” both 
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TABLE 4 


COEFFICIENT OF CONCORDANCE (W) AND 
AVERAGE RHO FOR THREE Groups 
or ASSESSMENT MEASURES 


Control Group (N = 20) 
Coeffi- 


Aver 
cient of 2 of 
Measures concord- signifi- 
ance cance 

Psycho-Somatic Inventory 60 45.83 oo! 47 
Self-Rating Scale 
Affect Questionnaire 
Happiness Scale 
Thematic Apperception Test . 69 52.12 01 38 
Sentence Completion Test 
Word Association Test 
Azzageddi Test 
Clinical Evaluation 71 66.96 .63 


Self-Rating Scale 

Affect Questionnaire 
Sentence Completion Test 
Word Association Test 


methods of assessment yielding essentially 
the same picture of the subjects’ relative 
standing on alienation. 


Findings Within the Experimental 
Group. 

Procedures and criteria for scores and 
rank orders were the same as described 
for the control group. Table 5 contains 
the scores on the nine direct and pro- 
jective instruments. The score ranges 
obtained were relatively small, particu- 
larly on the direct measures. The tend- 
ency of the cluster toward 
nonalienation suggests distortion by the 
subjects in an attempt to present a “nor- 


scores to 
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TABLE 5 
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mal” or “acceptable” personality profile. 

The intercorrelation matrix, presented 
in Table 6, reveals 33 nonsignificant 
coefhcients out of the total 46 intercor- 
relations. Of the 12 significant figures, 
only 5 are significant at the .o1 level. 
Nine of the significant correlations are 
among the measures. These re- 
sults clearly demonstrate lack of consist- 
ency within the experimental group, and 


clirect 


the erratic test performances imply con- 
scious distortion by the subjects. The in- 
attributed to two 
factors: (a) the subjects did not know 
that they were being tested for alienation 

they knew, however, that the goal of 


consistency may be 


the testing program was to secure several 
“mature, well adjusted” personalities; 
and (b) the complexity of the alienation 
syndrome prevented any “pegging” of the 
“right answers.” Thus, as a result of the 
uncertainty and diffuseness of the ma- 
terial, distortion attempts resulted in 
inconsistent test performances. 

Two of the three coefficients of con- 
cordance computed for the various test 
groupings are significant as seen in Table 
7. Performances on the projective test 
were sufficiently inconsistent so as to 
prevent significant concordance. Both the 
coefficient of concordance and the average 
rho were significant in the group of di- 


EXPERIMENTAL Group ScorRES ON NINE DtRECT AND PROJECTIVE INSTRUMENTS 


(N= 


Measure Range of scores 

Affect Questionnaire 67-164 
Self-Rating Scale 39-96 
Happiness Scale 2.79-6.00 
Tavlor Scale 1-209 

P-S Inventory 37-304 
Word Association Test 40 .61-78.05 
Sentence Completion Test 5-35 
Azzageddi Test 21-42 


TAT - 5-39 


23) 

Maximum possible Mean score Variance 

range 
5O-300 135.04 430.04 
25-150 65.30 204.04 
1-6 05 57 
o-50 11.87 47.03 
340-378 56. 26 8,562.30 
O-100 3.74 39.72 
22.09 44.08 
o-70 32.78 36.27 
Qo-150 10.74 171.84 
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INTERCORRELATIONS (RHO 


Clinical P-S Self- 
Mess laylor ‘ : 
Measures’ evalua- inven- rating 
. scaie 
tion tory 


scale 


Self-rating 


Affect .02 28 .59 
Happiness 51 43 34 o1 
SCE 13 17 09 19 
Azzageddi — .13 10 05 
rAT 23 07 31 20 
WAT . 36 33 47 23 


Note.- 


.o1 levels. 


For a one-tailed test, coefficients of .36 


rect measures. Yet, in the group contain 
ing both direct and projective measures, 
the average rho fell far short of signifi 
cance. These findings, although not fully 
supporting the prediction of inconsist- 
ency, do not provide sufficient contra- 
diction, 


The direct measures included in the 


first concordance group were all quite 


susceptible to distortion, and the aims of 


each test were sufficiently obvious to 


allow consistent distortion. Two of the 
instruments, the Happiness Scale and the 
Psycho-Somatic Inventory, were not meas- 


ures of alienation and hence were even 
more transparent. Furthermore, concord- 


ance among the five direct and projective 
measures, although significant, is not of 
sufficient magnitude to bring the average 


rho to the required significance level. 


Comparison of the Two Groups 


In order to test the hypothesis of dis 
tortion in the experimental group, it was 


necesary to compare the performances of 
the two groups on the various assess- 
ment measures. According to the predic- 
tions, the experimental group should re- 
veal less alienation and maladjustment on 


the direct measures, but should not differ 


PERSONALITY ASSESSMENT 


FABLE 6 


AMONG TEN MEASURES OF 
AND MALADJUSTMENT 


Experimental Group (N = 23) 


AND MOTIVATION 


ALIENATION 


Affect Happi- 
question- ness idi TAT 
naire scale 


.40 

28 

06 27 19 

17 14 .18 

13 .31 .10 -47 


and .49 are significant at approximately the .o5 and 


significantly on the projective measures. 
The results shown in Table 8 confirm 
the predictions. Using one-tailed ¢ tests, 
the differences between means on the five 
direct measures are in the predicted di- 
rection and significant beyond the .o5 
level, three being significant at the .o1 
level. Three of the variances differ sig- 
nificantly beyond the .o5 level, two of 
these at the .o1 level, and again are in 
the predicted direction. Predicting no 
difference on the projective measures and 
hence using a two-tailed ¢ test, no sig- 
nificant difference is found between the 


TABLE 7 
COEFFICIENT OF CONCORDANCE (W) AND 
AVERAGE RHO FOR THREE GROUPS OF 
ASSESSMENT MEASURES 


Experimental Group (N = 23) 
Coefh 


Aver- 
Measures xe age 
concord signifi rho 
ance cance 
Psycho-Somatic Inventory S4 47.30 002 39 
Self-Rating Scal 
Affect Questionnaire 
Happiness Scale 
Thematic Apperception Test 6 31.25 10 7 
Sentence Co Test 
Word Associa Test 
Azzageddi Test 
Clinical Evaluation 39 42.40 .006 23 


Self-Rating Scale 
Affect Questionnaire 
Sentence Cor t 


yn Test 
Word Associat Test 


7 
Taylor .18 
P-S 
; 1s 42 
: 
/ 


ANTHONY DAVIDS AND HENRY PILDNER, JR. 


TABLE 8 


COMPARISON OF ALIENATION AND MALADJUSTMENT SCORES IN THE CONTROL 
AND EXPERIMENTAL GROUPS 


Control Group 
(N = 20) 
Mean Variance 


Measures 


Experimental Group 
(N= 23) F t 
Mean Variance 


Self-Rating Scale 78.60 407.52 65.30 204.04 2.29* 2.346° 
Affect Questionnaire 176.55 1,559.95 135.04 436.04 
Happiness Scale 3.98 93 4.65 57 1.604 2.56* 
Taylor Scale 20.80 63.96 11.87 47.03 1.36 3.9%" 
P-S Inventory 137.65 25,730.57 256.26 8,562.30 or 3.60°° 
scT 24.30 55-91 22.09 44.08 3.37 1.01 
WAT 61.90 58.09 63.74 36.72 1.58 .88 
TAT 22.60 163.94 19.74 171.84 1.05 -72 
Azzageddi Test 36.05 64.89 32.78 30.27 1.78 1.52 
Note.—On the Happiness Scale and P-S Inventory lower scores indicate greater unhappiness and 


neuroticism. 
* Significant at the .o5 level. 
** Significant at the .or level. 


alienation scores on the four projective 
tests. Likewise, none 
differ significantly. 


of the variances 


These results clearly demonstrate that 
the experimental subjects, motivated to 
perform on these tests according to “ma- 
ture, well adjusted” personality stand- 
ards, were successful in distorting the 
direct measures in the desired direction, 
yet in spite of this motivation, were un- 
able to alter significantly their alienation 
profile as measured by the projective 
methods. The F tests reveal significant 
clustering of scores about the mean in 
the experimental group on three of the 
five direct measures and no significant 
difference between the score variability 
of the two groups on the projective meas- 
ures. These results indicate the success- 
ful attempt of the experimental subjects 
to adhere to a normative pattern in re- 
sponse to the direct measures, and the 
comparatively unsuccessful attempt in 
response to the projective measures. 


The projective instruments, Projective 
material can be treated in many ways in 
regard to both scoring and interpreta- 
tion. Furthermore, in spite of the appar- 
ent ambiguity of the stimulus material, 


projective measures are open to distor- 
tion, particularly through inhibition of 
responses by the subject. For these 
reasons a more detailed report of the 
findings comparing the projective test 
performances of the two groups is appro- 
priate and will serve also to support the 
rationale of the projective test scoring 
system. 

It has been pointed out by Getzels that 
first person items on the Sentence Com- 
pletion test are not truly projective stim- 
uli (20). Accordingly, in this experiment 
only third person items were considered 
in computing alienation scores on the 
SCT. In order to test Getzels’ hypothesis 
the mean alienation scores on first person 
items for each group were compared. It 
was predicted that the experimental 
group would less alienation on 
first person items since these more closely 
resembled direct inquiry and therefore 
would be more susceptible to distortion. 


show 


A t of 1.85 showed a difference between 
mean scores in the predicted direction 
significant at the .o5 level. As a further 
test of this hypothesis, rank-order corre- 
lations between alienation scores on first 
and third person items were computed 
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TABLE 9 


COMPARISONS OF THE CONTROL AND EXPERIMENTAL GROUPS ON Four 
VARIABLES IN THE WAT 


Control Group Experimental Group 
(N = 20) (N = 23) t 


Level 
of Sig- 


Variable 


Mean Variance Mean Variance nificance 
Total number of responses per protocol 487.65 7,561.50 33.87 12,400.48 1.74 05 
Number of alienation responses 201.55 2,762.89 123.39 872.89 5.80 .OO1 
Number of neutral responses 164.50 3,114.89 237.91 11,738.81 2.84 .O1 
Percentage of neutral responses 33.01 50.82 53-03 165.16 6.42 .OO1 


within each group. The correlation in 
the control group was .83, whereas in the 
experimental group the correlation was 
only .44. Although this correlation of .44 
is statistically significant, it leaves ap- 
proximately 80°% of the variance unex- 
plained. From these findings it can be 
seen that first person stems on the SCT 
even when intermingled with third per- 
son stems do not act as efficiently as the 
third person stems in eliciting uncon- 
scious or inhibited material and can be 
distorted, 

Although calculated distortion of the 
Word Association Test is virtually im- 
possible, inhibition of response can be 
achieved in two ways: by restricting the 
number of responses and by deliberately 
giving neutral or unemotional responses. 
Such inhibition the 
level analyst as well as the manifest con- 


frustrates covert 
tent scorer. In order to control for inhi- 
bition on the WAT in this study, a per- 
centage alienation score was computed, 
based on the number of negative re- 
sponses given in propotrion to the total 
number of scorable responses, i.€., posi 
tive and negative responses. To test in- 
hibition of response within the experi- 
mental group, ¢ tests were made compar- 
ing the two groups on total number of 
responses per protocol, number of aliena 
tion responses, number of neutral or mis- 
cellaneous responses, and percentage of 
neutral responses. Under the assumption 


that the experimental subjects would in- 
hibit their responses, it was predicted that 
this group would make fewer total re- 
sponses, fewer alienation responses, more 
neutral responses, and would give a 
greater percentage of neutral responses. 
All the predictions were confirmed, as 
the results presented in Table g show. 
The comparatively small difference be- 
the total re- 
sponses per protocol can be explained by 
the logical supposition of a motive for 
quantitative production on the part olf 


tween mean numbers of 


the experimental subjects in an effort to 
satisfy the demands of the test. These 
results emphasize the necessity of allow- 
ing for inhibition on the written chained- 
response word association technique, 
especially under conditions which mo- 
tivate distortion, and suggest methods of 
checking for suspected inhibition or dis- 
tortion. 

A weighted percentage scoring system 
for the Azzageddi Test has been devel- 
oped by Davids (6) which controls for re- 
sponse inhibition more effectively than 
the alienation total system used in this 
study. The mean alienation scores of the 
two groups were almost equal under this 
scoring system (60.87 and 60.20). Because 
of the confusing nature of the Azzageddi 
Test and its presentation as an “auditory 
test of intuitive intelligence,” it was pre- 
dicted that both groups would give ap- 
proximately the same total number of 
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responses per protocol. A ¢ of 1.73 showed 
no statistically significant difference be- 
tween the total responses made by each 
group, although the experimental group 
gave fewer responses. 

The TAT yielded uniform results in 
both groups, Although a few subjects in 
the experimental group restricted their 
stories to bare minimum, the large ma- 
jority of stories were equal in length and 
elaboration to those given by the control 
subjects. The consistency of scoring cri- 
teria and rater judgments in interpreting 
the TAT protocols strongly reinforces 
the reliability of the results reported for 
this measure and illustrates the impor- 
tance of a full understanding of the par- 
ticular criteria and aims involved when 
dealing with TAT material requiring 
subjective interpretation. (54). 


DiscUSsSION AND CONCLUSIONS 


The results of this study demonstrate 
that personality assessment measures 
are likely to be distorted by individuals 
acting in situations such as personnel 
screening where there exists a strong mo- 
tivation to create a favorable impression. 
In comparison with a control group of 
subjects among which 
honesty was reasonably assumed, an ex- 
perimental group of subjects, performing 


test response 


in a personnel selection situation on a 
test battery identical to that administered 
to the control group in a scientific re- 
search situation, appeared far better ad- 
justed and less alienated when tested by 
direct methods, yet showed equal malad- 
justment and alienation on projective 
measures. Furthermore, the experimental 
group performed less consistently on the 
assessment measures and yielded partic- 
ularly erratic projective test perform 
ances. All the hypotheses concerning 
predicted test performance of the two 
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groups under the different motivational 
conditions were substantially confirmed. 
The evidence resulting from this investi- 
gation has several important implica- 
tions, both practical and theoretical, for 
personality assessment. 

The over-all results of this study stand 
clear and need no elaborate discussion. 
An intelligent adult male of college level 
education has little difficulty in distin- 
guishing “good” and “bad” responses to 
a personality inventory and is well able 
to appear exceptionally well adjusted 
and happy if he should so desire. Al- 
though close examination of responses 
in respect to internal consistency may dis- 
close attempted distortion, such proof is 
generally inconclusive, and the high cost 
of detailed inspection renders it virtually 
impracticable in large group testing. The 
use of indirect phrasing of questionnaire 
items as a method of preventing possible 
distortion is without certain 
hazards. Frequently in attempting to dis- 
guise the purpose of an item or series of 


also not 


questions, the resulting stimulus material 
becomes so ambiguous that responses 
may vary widely and the test ultimately 
may lose validity as a direct measure. The 
optimum adjustment in test construction 
between transparency and ambiguity is 
an exceedingly elusive point and even 
when reached may be incapable of de- 
fending against insight and intuitive 
sense. Although exceptional insight and 
sophistication may not characterize the 
majority of subjects in various assessment 
programs where good adjustment is re- 
warded, enough personality assessment 
of persons with higher educational train- 
ing is carried on in industrial, military, 
and educational settings to warrant cau- 
tion and anticipation of unusually pro 
ficient test faking. 

Even the wise and wary subjects, how- 
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ever, apparently cannot successfully dis- 
guise projective measures without dis- 
tinctly revealing their defensiveness. As 
was mentioned in the discussion of the 
Word Association Test, restriction of 
total response material is the most effec- 
tive method of thwarting projective 
measurement. Yet such restriction is im- 
mediately obvious and may even contri- 
bute to final personality appraisal. The 
calculated ambiguity of projective test 
material is ideally suited to bypass pre- 
text and social propriety and thereby 
investigate motives and attitudes which 
determine the adjustment of a person- 
ality configuration to its environmental 
setting. 

Projective testing, in spite of all its 
near magical efficacy, is inescapably 
cursed with an inherent paradox, for in 
its strength lies its greatest weakness. As 
a result of the idiosyncratic responses 
evoked by the ambiguous projective 
stimuli, the problem of standardized 
scoring schemes and, in the case of the 
Word Association Test and the Sentence 
Completion Test, standardized structure 
have perplexed investigators and stymied 
widespread use of projective techniques 
in large group assessment situations. 
Furthermore, although several of the 
important projective techniques, such as 
the Sentence Completion Test, the Word 
Association Test, and the TAT, can be 
group administered, projective tests are 
essentially personalized measures and 
lose some of their effectiveness when not 
administered individually. Thus, in addi- 
tion to scoring and structural problems, 
the necessity of individual administration 
seriously handicaps practical extension 
of projective assessment measures into 
applied testing situations where predic- 
tive ability and standardization are at a 
premium. 
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In both groups tested in the present 


study, intercorrelations between rank 


orders were lowest between projective 


and direct measures. Within the control 
group, five of the six insignificant corre- 
lations were between direct and projec- 
tive instruments, and the seven negative 
correlations in the experimental group 
involved projective measures. It must be 
noted, however, that all six intercorre- 
lations between projective measures in 
the control group were significant, four 
at the .o1 level. This lack of direct-pro- 
jective test consistency in the case of the 
control group, although not of serious 
magnitude, appears somewhat confusing 
especially in view of the careful structur- 
ing of the projective instruments in order 
to measure the same personality qualities 
dealt the 
(The 
fically structured but rather was scored 


with by direct instruments. 


PAT, of course, was not speci- 


expressly for alienation.) The crucial 
theoretical question of the level of be 
havior represented in responses to various 
types of psychodiagnostic measures is 
here relevant and should be included in 
any full consideration of intertest con- 
sistency (52). A discussion of this question 
will be presented below in connection 
with related theoretical issues. 

It is readily apparent from the above 
evidence that personality appraisal, par- 
ticularly in real life assessment situations, 
must be based on both direct and pro- 
jective measurement if maximum validity 
is to be achieved, Projective assessment 
methods, in addition to providing a 
largely foolproof means of detecting at- 
tempted distortion, are able to illuminate 
critical areas of personality which often 
elude direct measures. Although projec- 
tive measures are basically clinical tech- 
niques best suited for individual diag- 
nosis, they can be employed successfully 
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in large scale assessment settings through 
careful structuring standardized 
scoring and analysis procedures. Regard- 
less of procedural difficulties, projective 
assessment techniques can contribute a 
great deal more to the study of normal 
personality than Allport seems willing 
to admit and should not be restricted to 
the study of neurotic and disturbed per- 
sonality. 

Thus far we have been considering 
personality assessment primarily in situ- 
ations, such as industrial screening in 
which strong motivation to distort test 
responses many be present. The role 
played by situational and motivational 
factors in influencing assessment test 
performances must not be underesti- 
mated, A sharp distinction has to be 
made between the willingness of the 
individual to reveal personality charac- 
teristics and his ability to do so. Allport 
declares that the normal adult is able to 
comprehend his general personality 
make-up and can accurately describe his 
dominant motivational patterns. Hence 
simple self-report is an adequate basis 
for personality appraisal, and intricate 
investigation of unconscious sectors of 
mental functioning is largely superfluous. 
On the other hand, the neurotic is in- 
capable of self-expression, and indirect 
assessment techniques are necesary, In 
support of this claim, Allport cites the 
behavior of the neurotic group studied 
by Getzels (20). Responses to first person 
sentence completion items made by this 
group were conventionally correct and 
followed normative standards, whereas 
the responses to third person items in- 
dicated a high degree of neuroticism and 
anxiety. In contrast, the well adiusted 
group of subjects gave similar responses 
to both of the »aired direct and projec- 
tive items. However, Getzels himself 
pointed out that the neurotics were 
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anxious to appear well enough adjusted 
so that the Veterans Administration office 
would approve their requests for educa- 
tional benefits. Clearly, in such a situa- 
tion it is difficult if not impossible, to 
separate willingness from ability. Rather 
than assuming incapability of self-under- 
standing, it seems much more feasible to 
regard the direct-indirect item response 
inconsistency as a manifestation of con- 
scious defensiveness by the neurotics, 
willfully intended to disguise personality 
infirmities. The indirect items, being far 
less susceptible to distortion, were able 
to pierce the protective facade and sub- 
mit a more accurate picture. 

The findings, both in the study con- 
ducted by Davids (8) and in the present 
investigation, contradict Allport’s predic- 
tion of inconsistent direct-projective test 
performance by neurotics. The three sub- 
jects in Davids’ two experimental groups 
who showed a high degree of neuroticism 
performed consistently on all the test 
measures and were frankly aware of 
their difficulties. In the present study, the 
single individual in the experimental 
group who displayed neurotic character- 
istics revealed his maladjustment on the 
direct measures as well as the projective 
measures. Although a sample of such 
small size as provided by these two studies 
not offer a 


substantial basis for 


conclusion, it seems reasonable to assume 


does 


that neurotic defense may frequently be 
quite conscious and that a distinction 
should be made, as Davids suggests, be- 
tween the “cooperative neurotic” and the 
“defensive neurotic.” In the case of the 
former, whereas projective techniques 
may probe deep into motivational origins 
of structural unbalance and _ facilitate 
diagnosis and therapy, direct assessment 


measures likewise reveal maladjustment 


and indicate dominant neurotic trends. 
In dealing with the defensive neurotic 


be 
q 
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the question of ability to express malad- 
justment becomes crucial. In this case, 
Allport’s proposal is most tenable. Yet it 
must be remembered that extreme neu- 
rotic defensiveness which renders the in- 
dividual incapable of accurate self-report 
is often a prepsychotic symptom, and 
such behavior falls in the borderline area 
between neurosis and psychosis. Allport’s 
theory, then, of the relations between 
direct and projective test performances of 
neurotics may actually apply best to the 
prepsychotic individual who has lost a 
good deal of ego identity and introspec- 
tive ability. Further research comparing 
the test performances of neurotics on 
direct and projective measures is needed 
in order to validate more conclusively 
any proposals concerning the relation- 
ship between performance consistency 
and degree of maladjustment. 

Three conditions must be fulfilled if 
concordant direct-projective test perform- 
ance is to be expected and if direct meas- 
ures are to be used as valid indices of 
personality traits, The individuals being 
assessed must be willing to submit honest 
report. They must be able to make such 
a report. Thirdly, the dispositions or 
attributes being measured must be of a 
generally overt nature and clearly de- 
fined. In other words, the self-report of 
prepsychotic individuals on matters re- 
lating to their resolution of Oedipal 
conflicts and the consequent effects on 
later cathetic relationships could hardly 
be given serious attention as a valid per- 
sonality profile, even if the assessment 
situation favored honesty and accuracy 
was rewarded. On the other hand, it is 
perfectly credible to accept as largely 
valid the direct report of a normal or 
moderately neurotic individual on vari- 
ables such as expressed aggression or 
psychosomatic experiences in a testing 
situation in which the individual stands 


to gain nothing by distorting responses 
or may actually benefit by truthful re- 
sponse. Under such conditions, when a 
relatively specific index of behavior is 
needed or comparative magnitudes are 
required, direct techniques are thor- 
oughly sufhcient to provide the necessary 
information and can replace expensive 
projective measures. 


Many studies have inadvertently demonstrated 
this observation. In a study of facial asymmetry 
(37), Lindzey obtained a measure of neuroticism 
for 20 subjects participating in a research pro- 
gram which was based on the clinical evalua- 
tions of four psychologists. The evaluations were 
derived from interviews, life history material, 
and projective tests. The scores of the subjects 
on the Psycho-Somatic Inventory were available 
and were correlated with the combined clinical 
evaluations as an incidental measure of neuroti 
cism. The resulting correlation was significant 
beyond the .o5 level and offered a strong indica- 
tion that the economical g2-item inventory could 
serve as an index of neuroticism equally as effec- 
tive as that obtained by the more thorough, costly 
method The effectiveness of the 
P-S Inventory as a behavior index and predictor 
was again born out in another study by Lindzey 
and Newburg of signs of anxiety in the TAT 
(36). An independent measure of anxiety was 
obtained for 20 subjects who had taken the TAT. 
The measure was a “diagnostic council” rating 
based upon a variety of information de- 
rived from observation, situational tests, and a 
number of indirect measures. The council was 
composed of clinical psychologists who were 
familiar with the material which had been ac- 
cumulated. The rank-order correlation between 
the council ratings and the P-S Inventory scores 
of the 20 subjects was .69, significant well beyond 
the .o1 level. It is interesting to note that, al- 
though these investigators did not emphasize the 
agreement between direct and projective evalua 
tion methods or point out the adequacy of the 
economical P-S Inventory, evidence in support of 
these observations did result. 


of assessment. 


wide 


Further relevant evidence is supplied in a 
study by Lindzey and (38) who, in 
studying aggression signs in the TAT, found that 
self-ratings on aggression correlated highly with 
aggression FAL material. This 
study demonstrates the three conditions 
instrumental in producing test performance con- 
sistency and direct test validity. The subjects were 
in good rapport with the investigators, fully 
capable of self-report, and a well defined variable 
was being measured. 

The effectiveness of the direct approach to 
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personality and the hazards of indefinite assess- 
ment objectives are vividly demonstrated in a 
study conducted by Eriksen (17). Studying psy- 
chological stress and its personality correlates, 
Eriksen found a marked lack of relationship be- 
tween performance under stress and various 
Rorschach signs. However, information offered by 
the subjects in a personal interview proved an 
excellent indicator of stress performance. Eriksen 
comments: ‘Finally, we may note that our sub- 
jects were able to predict the direction of the 
change in their performance under stress. As- 
suming that this finding has generality, we are 
left with the amusing conclusion that, with our 
present techniques, the best way to predict how 
a man will perform under stress is to ask him” 
(17, p. 285). The failure of the Rorschach ma- 
terial to correlate with stress performance is 
explained by Eriksen as a lack of assessment 
focus. “It is not altogether surprising that little 
or no relationship is found between Rorschach 
variables and stress performance. In fact, we 
would be somewhat surprised if any single per- 
sonality measure produced a relationship with 
stress performance which was high enough to be 
practical. When one carefully analyzes the bases 
of performance under stress, the complexity of 
the problem of prediction becomes apparent. 
People can perform similarly under stress for a 
variety of different reasons. . . . The prediction 
of performance under stress requires tools that 
will measure all the pertinent variables 

[and] the first step is actually to identify what 
the relevant variables may be” (17, pp. 283-284). 


The above mentioned studies serve to 
demonstrate the value of direct assessment 
measures in making pertinent predictions 
and evaluations of certain behavior vari- 
ables. In each testing situation, the three 
conditions necessary for direct test ac- 
curacy and intertest consistency were 
operant. The need for assessment focus 
and definition of variables was especially 
well illustrated by the Eriksen study. In 
the present study the contrast between 
willingness and ability is clearly seen. 
Both groups of subjects were fully cap- 
able of accurate self-report; yet the ex- 
perimental group, tested in the employ- 
ment situation, showed considerable di- 
rect-projective test inconsistency and 
comparatively little alienation and mal- 
adjustment on direct measures. In spite 
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of the fact that the variables being 
measured were defined and consistently 
tapped by direct and projective instru- 
ments, the unwillingness of the subjects 
to reveal character weaknesses or person- 
ality “sore-spots” caused the direct meas- 
ures to be nearly valueless as accurate pre- 
dictors or devices for appraisal. The con- 
trol group, on the other hand, both will- 
ing and able to offer honest self-report 
material, performed consistently on all 
the assessment measures, and the indices 
of alienation obtained from the direct in- 
struments were in most cases highly 
correlated with composite evaluations 
based on the results of the entire assess- 
ment program. It must be emphasized 
that the variables in the alienation syn- 
drome, although closely related to dy- 
namic personality structure, are largely 
overt behavior characteristics, manifested 
on a conscious level by the individual, 
and fully within reach of introspection 
and self-appraisal. Thus, it can be con- 
cluded that, although direct assessment 
is often valuable and economical, any 
assessment results must be viewed in 
respect to the degree of willingness and 
ability of the subjects to respond accu- 
rately to assessment measures, the nature 
of the variables involved, and the extent 
to which the variables are defined. 

As was mentioned above, a critical 
factor in personality analysis is the level 
of behavior elicited by various assess- 
ment techniques. Psychoanalytic theory 
has stressed the duality, and often the 
discontinuity, of overt and unconscious 
behavior. Although surface attitudes and 
traits usually reflect the dynamic motiva- 
tional framework of the personality, the 
two components of behavior are never- 
theless distinguishable and may coexist 
at varying degrees of integration. The 
psychotic frequently demonstrates a com- 
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plete separation of conscious and uncon- 
scious, while the well adjusted individual 
achieves a harmonious relationship which 
permits satisfactory fulfillment and ex- 
pression. Generally, allowing a broad 
definition of the concept of normality, 
the normal individual maintains a con- 
tinuity between his overt behavior and 
unconscious motivation. Yet enough of a 
discrepancy exists even in the normal 
individual to make uncertain personality 
assessment through combined direct and 
projective measurement. 

Projective techniques are commonly 
believed to elicit deeper and sometimes 
repressed personality information. By im- 
posing structure on the stimulus material 
of instruments such as the Rorschach and 
the TAT, the individual may reveal un- 
conscious strivings and conflicts of which 
he is not fully aware. In response to 
direct approaches the individual may 
accurately describe conscious personality 
attributes and only vaguely suggest perti- 
nent deeper level functionings. ‘Thus, 
even though assessment focus is identified 
and defined, material gained from the 
two methods of measurement can be 
sufficiently dissimilar so as to cause con- 
cern to the investigator seeking a uniform 
personality profile. As a consequence, a 
great deal of indecision and uncertainty 
may result in an 
where 


assessment situation 
direct and indirect 
material has been obtained from normal, 


cooperative individuals. 


inconsistent 


The imaginative powers and the intri- 
cate motivational channelings in the 
human mind are by no means completely 
understood, nor can they be efficiently 
exposed by projective techniques. How- 
ever, much confusion resulting from 
slightly discrepant direct and projective 
test material is quite unnessary and can 


be eliminated by avoiding artificial sepa- 
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ration and treating all the material as es- 
sentially congruent. It is of singular im- 
portance that projective material be 
evaluated and analyzed in comparison 
with information gained from direct 
assessment. “Blind analysis” of either 
direct or projective material is decidedly 
an academic exercise and cannot be relied 
upon in an applied assessment situation. 
Only by considering the conscious level 
of behavior in relation to the unconscious 
and by observing the compensations, re- 
strictions, directions of energy, and all 
the other adaptive mechanisms involved 
in the interplay between the two com- 
ponents can any systematic knowledge 
about the integration of personality be 
gained. It may be said, then, that, while 
projective and direct personality apprais- 
al may yield different quantitative results, 
these results may actually contribute to 
the formation of an accurate qualitative 
personality appraisal. Direct and projec- 
tive assessment methods do elicit material 
representative of different levels of be- 
havior, and the relative psychodiagnostic 
value of this information must be recog- 
nized. 

In any assessment situation in which 
both direct and projective techniques are 
employed, an essentially two-dimensional 
measurement is made. Information per- 
tinent to both surface and unconscious 
personality areas is gathered and must 
be interpreted within its proper context. 
Yet in spite of the more thorough analy- 
sis which results from this dual approach 
to personality, prediction of behavior 
often becomes more complicated, par- 
ticularly in the case of normals. The pro- 
fessed attitudes of the normal individual 
change as circumstances vary and reality 
imposes new adaptation desideratum, 
whereas the neurotic in many cases fails 
to respond to changing reality conditions 
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and adheres irrationally to compulsive 
notions. The personality structure of the 
normal includes a number of motivation- 
al patterns subtly woven to permit a 
satisfactory adjustment. It is difficult to 
single out and follow through a single 
motivational thread without becoming 
entangled in others. Unconscious aggres- 
sion may be closely tied to dependency 
needs, or sexual expression may be di- 
rected both externally and narcistically. 
As a result, no one motivational trend 
will show up prominently or overbalance 
behavior. In the motivational pattern of 
the neurotic, on the other hand, a spe- 
cific thread may be awry or exaggerated 
and will consequently be more visible in 
both conscious and unconscious behavior. 
This lack of predominant motivational 
trends reflecting neurotic fixation among 
the control group in the present study 
further explains the relatively poorer 
success of quantitative comparison of 
direct and projective test performance. 
This analysis suggests the interesting con- 
clusion that the maladjusted individual 
with all his deviation and irrationality 
may be more easily analyzed than the 
normal personality. 

In conclusion, further research using a 
similar experimental design to study 
different personality variables, compare 
other assessment instruments, and_ test 
neurotic and psychotic personalities is 
greatly needed and promises to make 
valuable contributions to the under- 
standing and efficiency of personality 
assessment. 


SUMMARY 


The purpose of this study was to in- 
vestigate the relative performances on 
direct and projective personality assess- 
ment measures by subjects under differ- 
ent motivational conditions. A primary 
objective was an empirical evaluation of 
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Allport’s theoretical notion of the con- 
sistency of direct and projective assess- 
ment results and the consequent impli- 
cations of the dispensability of costly 
projective techniques in most assessment 
situations involving normals. 

A survey of the relevant literature re- 
vealed that very few studies have com- 
pared performances on direct and _pro- 
jective measures which were explicitly 
designed to certain defined 
personality variables, Also, the majority 
of studies investigating test distortion and 
fakability were “laboratory” experiments, 


focus on 


and observations were derived from per- 
formance motivated by artificial incen- 
tive. In the present study a real employ- 
ment situation was set up in which test 
performance motivation sincere 
rather than feigned. A battery of ten 


was 


personality assessment measures was as- 
sembled which included four projective 
methods, five direct methods, and a clini- 
cal evaluation by an experienced psycho- 
logist. The assessment focus was on a 
personality configuration known as the 
“alienation syndrome,” and all the in- 
struments were designed to measure the 
eight dispositions forming the syndrome 
or related maladjustment symptoms. The 
variables making up the syndrome in- 
clude three polar dispositions: optimism- 
pessimism, trust-distrust, and sociocen- 
tricity-egocentricity; and two additional 
negative or socially undesirable disposi- 
tions: resentment and anxiety. 

Iwo groups of subjects were studied, 
all of whom were male college undergrad- 
uates. The subjects acting as the con- 
trol group consisted of go undergraduates 
who had volunteered to participate in 
an extensive two-year study of person- 
ality. These men, coming from varied 
social and economic backgrounds, were 
guaranteed anonymity and encouraged 
to respond honestly to the various test 
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measures. Exceptionally good rapport 
was established. 

The experimental group consisted of 
23 undergraduates of equally varied 
backgrounds and personality make-up. 
However, unlike the control group sub- 
jects, these men did not volunteer to 
take psychological tests. The experimen- 
tal subjects were contacted through the 
college employment office where they 
were registered as seeking employment. 
They were told that a few positions were 
open as research assistants at the psycho- 
logical clinic, and, upon application, 
were informed that a personnel screening 
procedure was necessary to aid the inves- 
tigators in selecting “mature, well ad- 
justed” personalities who were well suited 
for psychological research work. Only 
the experimenters knew that the em- 
ployment situation was fictitious, and 
the experimental subjects underwent the 
test battery presumably as part of an 
employee selection procedure. 

The test measures included two self- 
rating scales, three questionnaires, a word 
association test, a sentence completion 
test, the Thematic Apperception Test, 
and an auditory projective test. All but 
three of the instruments were structured 
and scored to provide a measure of the 
alienation syndrome. The tests were ad- 
ministered in four separate experimental 
sessions, with the projective methods pre- 
ceding the direct methods. 

It was expected that the experimental 
subjects would attempt to distort the 
personality measures in an effort to ap- 
pear “mature and well adjusted.” Be- 
cause of the nature of the projective test 
material, it was predicted that the ex- 
perimental group would be unable to 
alter significantly their performance on 
the projective measures so as to appear 
exceptionally well adjusted or in this 
case, nonalienated. However, since the di- 


rect instruments were far more trans- 
parent, it was expected that they would 
be quite susceptible to distortion. Since 
the control group subjects were moti- 
vated to respond honestly to these assess- 
ment measures, the experimenters pre- 
dicted that the experimental group 
would show significantly less alienation 
on the direct measures and would not 
differ from the control group on projec- 
tive test alienation scores. Also, it was 
predicted that the experimental group 
would perform less consistently on direct 
and projective tests and therefore, in 
comparison with the group, 


would show less concordance among rank 


control 


orders based on the results of the various 
measures. 

The results confirmed the predictions 
of both intragroup test performance and 
comparative performance. Within the 
control group nearly all the 45 correla- 
tion coefficients correlating the ten meas- 
ures were 
level. All 
measuring agreement of rank orders in 


significant beyond the .o5 


coefficients of concordance, 


three groups of various direct and projec- 
tive tests, were significant at the .oo1 
level. The experimental group showed 
far less consistency of test performance. 
\pproximately one-fourth of the 45 inter- 
correlations were significant at the .o5 
level, and although 2 of the 3 coefhcients 
of concordance were significant beyond 
the .o1 level, they were of considerably 
less magnitude than those of the control 
group. Comparison of the test perform- 
ances of the two groups revealed that, 
while the experimental group admitted 
significantly less alienation on the direct 
measures, both groups showed the same 
amount of alienation on the projective 
measures. The score ranges and tests of 
homogeneity of variance indicated that 
the experimental group attempted to ad- 
here to a 


normative standard on the 


i 
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direct measures, yet were less successful 
on the projective measures. It was also 
found that, within the control group, 
neuroticism, manifest anxiety, and un- 
happiness were positively correlated with 
alienation. As a result of the erratic test 
performances within the experimental 
group, these relationships were far less 
pronounced. 

The over-all results of this study dem- 
onstrate that personality assessment mea- 
sures are likely to distorted by 
individuals undergoing assessment in 
situations such as personnel screening in 


be 


which there exists a strong motivation to 
create a favorable impression. Such dis- 
tortion is particularly likely to occur 
when the assessment subjects are of 
higher educational level and have a cor- 
responding degree of sophistication. The 
results of the projective tests indicate 
the effectiveness of projective methods 
in controlling assessment deception by 
both revealing inconsistencies indicative 
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